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ESTIMATION OF THE HURST PARAMETER FROM 
DISCRETE NOISY DATA 

By Arnaud Gloter and Marc Hoffmann 

Universite de Marne-la-Vallee 

We estimate the Hurst parameter If of a fractional Brownian mo- 
tion from discrete noisy data observed along a high frequency sam- 
pling scheme. The presence of systematic experimental noise makes 
recovery of H more difficult since relevant information is mostly con- 
tained in the high frequencies of the signal. 

We quantify the difficulty of the statistical problem in a min-max 
sense: we prove that the rate 7i~i/(*^+2) is optimal for estimating H 
and propose rate optimal estimators based on adaptive estimation of 
quadratic functionals. 

1. Introduction. 

1.1. Motivation. Many processes of interest in physics, molecular bio- 
logy, finance and traffic networks possess, or are suspected to possess, self- 
similar properties. In this context, recovering the so-called scaling exponents 
from experimental data is a challenging problem. The purpose of this paper 
is to investigate a new statistical method for estimating self-similarity based 
on adaptive estimation of quadratic functionals of the noisy data by wavelet 
thresholding. 

We stay with dimension 1 and focus on the paradigmatic example of 
fractional Brownian motion. 

1.2. Statistical model. Let X be a one-dimensional process of the form 

Xt = oW^, 

where is a fractional Brownian motion with self-similar index (or Hurst 
parameter) K G (0, 1) and scaling parameter a G (0,-|-oo). In particular, X 
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is centered Gaussian with covariance E[XsXt] proportional to |tp + |s|^ — 
|t — sp^; see more in Section 4.1 below. 

In practice, it is unrealistic to assume that a sample path of X can be ob- 
served (in which case the parameters H and a would be identified). Instead, 
X is rather observed at discrete times. The problem of estimating H and a 
in this context has been given considerable attention (some references are 
Dahlhaus [5], Istas and Lang [15] and Ludeha [18]). 

In this paper we take the next logical step: we assume that each observa- 
tion is contaminated by noise, so that for i = 0, . . . , n we observe 

(1) >^"=^iA+a(x,A)er, 

where the are (centered) noise terms and is the sampling frequency. 
The function x ^ a{x) is an unknown nuisance parameter. 

Throughout, we assume that the experiment lives over a fixed time horizon 
[0,T], so we have T = nA. With no loss of generality we take T = 1, hence 
A = A„ = n~^. Recovering the Hurst parameter H from the data iY^) is 
our objective. 

1.3. Results. We show in Theorems 1 and 2 below that the rate 

7;„(if)=n-V(4^^+2) 

is optimal for estimating H. The accuracy Vn{H) is slower by a polynomial 
order than the usual obtained in the absence of noise. The difficulty 

lies in the fact that the information about H is contained in the high fre- 
quencies of the signal t Xf. Although the high frequency sampling rate n 
usually allows one to recover H at the classical rate n"^/^ when X is directly 
observed (by means of quadratic variations; see, e.g., [15]), the presence of 
the noise in this context significantly alters the nature of the problem. 

2. Main results. 

2.1. Methodology. The parameters {H,a) live in V C (0,1) x (0, +cx)). 
The process X and the noise variables (.^f) are simultaneously defined on 
a common probability space endowed with a probability measure IP/^o- for 
each n > 1 . 

A rate f „ ^ is said to be achievable over V if there exists a (sequence 
of) estimator(s) such that the (sequence of) normalized error(s) 

(2) v-\Hn-H) 

is bounded in ^-probability uniformly over T). The rate Vn is said to be 
a lower rate of convergence over T) if there exists c > such that 

(3) liminfinf sup ^[w^^J-H' - i?] > c] > 0, 

where the infimum is taken over all estimators H that are random variables 
measurable with respect to the sigma-field generated by the data (1^"). 
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2.2. The estimation strategy. The fact that X is a fractional Brownian 
motion enables one to predict that its energy levels 

(4) Q,:=Y,dl,:=Y,( f X,i;,^k{s)dsy 

k fc ^ 

scale (as for the approximation symbol ~, we do not yet specify it; see 
Proposition 1 below) with a ratio related to H, 



(5) Qj + l ~ 2-2/fg 



J' 



up to an error term that vanishes as the frequency level j increases. Here, 
dj^k is the random wavelet coefficient of the function t Xt relative to a 
certain wavelet basis {ipj,k^j > 0, A; G Z). In Section 3.2 below we construct 
a procedure 

(6) (F,")-(4^,A: = 0,...,2^ -l,0<j< J„) 

that processes the data into estimates of the squared wavelet coefficients d| 
up to the maximal resolution level Jn = [^log2(n)]. We obtain a family of 
estimators for H by setting 

-^j>:=-^log2^^, j = 1,..., J„ - 1, 



with 



The ratio level j between two estimated energy levels that contains maximal 
information about H is chosen by means of a block thresholding rule; see 
below. The rule is inspired by the methodology introduced for the adaptive 
estimation of quadratic functionals (see, among others, Efromovich and Low 
[7], Gayraud and Tribouley [9] and the references therein). 



2.3. Statement of the results. We consider for {H,a) regions of the form 
(7) V := [H^,H+] X [a.,a+] C (i,l) x (0,+oo). 

Assumption A. (i) The function x a{x) is bounded and continuously 
differentiable with a bounded derivative. 

(ii) The continuous time process X is ^-adapted with respect to a 
filtration J^" = (J^f,t>0). 
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(iii) The noise term at time i/n \s ^^"_|_j^^^^-measurable. Moreover, 
and 

sup supE^.,[(er)']<+oo. 

{H,a)eV i,n 

Theorem 1. Grant Assumption A. The rate Vn{H) := n~^/(^^+2) is 
achievable for estimating H over any region V of the form (7). Moreover, 
the estimator constructed in Section 3 and given by (9)-(ll) below achieves 
the rate Vn{H). 

This rate is indeed optimal as soon as the noise process enjoys some 
regularity: 

Assumption B. (i) 'm.ixa{x) > 0. 

(ii) Conditional on X, the variables are independent, absolutely con- 
tinuous with densities x ^ exp{—Vi^n{x)) vanishing at infinity (together 
with their derivatives) at a rate strictly faster than and 

(8) supE 

i,n 

Moreover, the functions x ^^Vi,n{x) are Lipschitz continuous, with Lips- 
chitz constants independent of i,n. 

Theorem 2. Grant Assumptions A and B. For estimating H , the rate 
Vn{H) := n"^/^^^"*"^) is a lower rate of convergence over any region T> of the 
form (7) with nonempty interior. 

We complete this section by giving an ancillary result about the estimation 
of the scaling parameter a, although we are primarily interested in recovering 
H. The estimation of a has been addressed by Gloter and Jacod [12] for the 
case H = 1/2 and by Gloter and Hoffmann [10] in a slightly different model 
when > 1/2 is known. Altogether, the rate Vn{H) is proved to be optimal 
for estimating a when H is known. Our next result shows that we lose a 
logarithmic factor when H is unknown. 

Theorem 3. Grant Assumptions A and B. For estimating a, the rate 
^-1/(4/^+2) jQg|-^-j is a lower rate of convergence over any region of the 
form (7). 



(i + ien') 



< +00. 
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2.4. Discussion. 

2.4.1. About the rate. We see that the presence of noise dramaticahy 
alters the accuracy of estimation of the Hurst parameter: the optimal rate 
Vn{H) = n~'^^^^^~^'^'^ inflates by a polynomial order as H increases. In par- 
ticular, the classical (parametric) rate n~^/^ is obtained by formally letting 
H tend to (a case we do not have here). 

2.4.2. About Theorem 1. The restriction > 1/2 is linked to the dis- 
cretization effect of the estimator. Assumption A can easily be fulfilled in 
the case of a noise process that is independent of the signal X. It is not min- 
imal: more general noise processes could presumably be considered, and, 
more interestingly, scaling processes more general than fractional Brownian 
motion as well. To this end, it is required that the energy levels of X satisfy 
Proposition 1 and that the empirical energy levels satisfy Proposition 2 in 
Section 4 below. We do not pursue that here. See also Lang and Roueff [17]. 

2.4.3. About Theorem 2. The lower bound is local, in the sense that V 
can be taken arbitrarily small in the class specified by (7). Observe that 
since the rate Vn{H) depends on the parameter value, the min-max lower 
bounds (3) are only meaningful for parameter sets T) that are concentrated 
around some given value of H. 

Assumption B(ii) is not minimal: it is satisfied, in particular, when the ^" 
are i.i.d. centered Gaussian. More generally, any noise process would yield 
the same lower bound as soon as Proposition 4 is satisfied (see Section 6.1). 

2.4.4. The stationary case. Golubev [13] remarked that in the particular 
case of i.i.d. Gaussian noise independent of ^ a direct spectral approach 
is simpler. Indeed, the observation generated by the — Y^-^ becomes sta- 
tionary Gaussian, and a classical Whittle estimator will do (Whittle [25] or 
Dahlhaus [5]). In particular, although some extra care has to be taken about 
the approximation in n, such an approach would certainly prove simpler in 
that specific context for obtaining the lower bound. 

2.4.5. Quadratic variation alternatives. The estimator constructed in Sec- 
tion 3 can be linked to more traditional quadratic variation methods. Indeed, 
the fundamental energy levels Qj defined in (4) can be obtained from the 
quadratic variation of X in the particular case of the Schauder basis (which 
does not have sufficiently many vanishing moments for our purpose). How- 
ever, the choice of an optimal j remains and we were not able to obtain the 
exact rate of convergence by this approach. 
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2.5. Organization of the paper. In Section 3 we give the complete con- 
struction of an estimator Hn that achieves the min-max rate Vn{H). Sec- 
tion 4 explores the properties of the energy levels of X (Proposition 1), 
as well as their empirical version (Proposition 2). Theorem 1 is proved in 
Section 5. Finally, Sections 6 and 7 are devoted to the lower bounds. It is 
noteworthy that the complex stochastic structure of the model due to the 
two sources or randomness {W^ and the noise requires particular ef- 
forts for the lower bound. Our strategy is outlined in Section 6: it requires a 
"coupling" result proved in Section 7. The proof of supplementary technical 
results, too long to be detailed here, may be found in [11]. 

3. Construction of an estimator. 

3.1. Pick a wavelet basis {ipj,k,j > 0,fc G Z) generated by a mother 
wavelet ■0 with two vanishing moments and compact support in [0, S] , where S 
is some integer. The basis is fixed throughout Sections 3-5. Assuming we 

have estimators ^ _ of the squared wavelet coefficients, recalling the def- 
inition (4) of the energy levels, we obtain a family of estimators for H by 
setting 

f> ^ 1 Qj+l,n .J J 

Hj,n-=--^log2^ , J =l,---,Jn-i, 

with 

k=0 

where Jn := log2(n)] is the maximum level of detail needed in our statisti- 
cal procedure and J_:= [log2(5' — 1)] + 2 is some (irrelevant) minimum level 
introduced to avoid border effects while computing wavelet coefficients cor- 
responding to location on [0, 1/2] from observations corresponding to [0, 1]. 
Following Gayraud and Tribouley [9] in the context of adaptive estimation 
of quadratic functionals, we let 

(9) J* : = max{ j = J, . . . , J„, : Qj,n > 2^ /n} 

(and in the case where the set above is empty, we let Jn = J. for definiteness) . 
Eventually, our estimator of H is 

(10) Hj*,n. 

The performance of Hj*^n is related to scaling properties of X and the 
accuracy of the procedure (6). 
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3.2. Preliminary estimation of the dij f,. For simplicity and with no loss 

of generality, we assume from now on that n has the form n = 2^ . Since -0 
has compact support in [0,5], the wavelet coefficient dj^k is 

dj,k = 'J / ^Pj,k{t)Wt" dt. 

This suggests the approximation 

S2^-^-l, k/2i+{l+l)/2^ 

^ \Jk/2i+l/2^ J 

for J_<j< Jn,0 1^ k < 2^^^ — 1. The difference dj^k,n ~ dj^k splits into 6j,fc,n + 
^j,k,m respectively a bias term and a centered noise term, 

52^-^-1 fc/2J + (/+i)/2JV 
bj,k,n = - X! / . jv i^j,k{t)i^t~ ^k/2i+l/2N)dt, 

32^^-1^ .k/V+(l+l)/2« X 

^i,fc,n - 2^ / '4^3,k{^)dt]0'{Xk/2J+l/2'^)ik2f^-3+l- 
JTq \Jk/23+l/2'^ J 

We denote by Vj^k,n the variance of ej^k,n, conditional on '^^lich is 

equal to 



S2^-^-l. k/2^ + (l+l)/2'^ \2 

^j,k,n= E / „ ^l;,,k{t)dt] E'}I^MXk/23+l/2^?\^2-A■ 

J^ \Jfc/2J+«/2^ / 



The conditional expectations appearing in this expression are close to a(X^/2J 
and thus may be estimated from the observations without the knowledge of 
H, a. We define 



„2 ._ n-N/2 



2iV/2 , 2^/-^ \: 

E 0^k2l^-i+l'f ~ I 2 JV/2 ^ Yk2f^-3+l' I 
i'=l \ «'=! / 



I 

and we set 

52^-1/ /•fc/2^ + (/+l)/2^ X2_ 

^i,fc,n= E / „ '4)j,k{t)dt \ a^^i^i^n- 

J^O \Jk/V+l/2N J ' ' 

Eventually, we set 

(11) dl^^^:={dj^k,nf -Vj^k,n 

and Hj*^n is well defined. We remark that if the function a is assumed 
known, one can considerably simplify the construction of the approximation 

" fc/2J,n- 
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4. The behavior of the energy levels. We denote by FH,a the law of X = 

aW^ , defined on an appropriate probability space. We recall the expression 
of the energy at level j, 



2^-1-1 
k=0 



Proposition 1. (i) For all e > 0, there exists r_(e) e (0,1) such that 
(12) inf ¥Hahnf2'^^"Qi> r_(e)\> 1-e. 



(ii) The sequence 

Qi+i 



(13) :=2^/2g^p 



Qi 

is bounded in ¥H,a-pi"obability, uniformly over D, as j — > +C30. 

Proposition 2. Let jn{H) := [ 2^+1 log2(^)]- Then Jn >jn{H) for all 
H G [H^,H^], and for any L > 0, the sequence 

Jn>j>jn{H)-L 

is bounded in ^-probability, uniformly overD, as n^oo. 

We shall see below that Propositions 1 and 2 together imply Theorem 1. 

4.1. Fractional Brownian motion. The fractional Brownian motion ad- 
mits the harmonizable representation 



where B is a complex Gaussian measure (Samorodnitsky and Taqqu [21]). 
Another representation using a standard Brownian motion B on the real 
line is given by 



* r(i? + l/2) I '+ + J « 

(r is the Euler function). The process is H self-similar and the covari- 
ance structure of is explicitly given by 

Cov(W^f , W,") = ^{\tr + - \t - ^P^}, 
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where k{H) = -k/ HT{2H) sui{t^H). Recall that dj,k = /k '4^j,k{s)Xs ds denotes 
the random wavelet coefficients of X, given a wavelet -0 with two vanishing 
moments. It can be seen, using the stationarity of the increments of , 
that, for a fixed level j, the sequence {dj^k)kez is centered Gaussian and sta- 
tionary with respect to the location parameter k. Moreover, the coefficients 
have the self-similarity property 

{d,,k)kez'= 2-^^^+'/'Hdo,k)kez; 

see Delbeke and Abry [6], Veitch and Abry [23], Abry, Gongalves and Flan- 
drin [1] and Veitch, Taqqu and Abry [24]. Moreover, 



VaT{dj^k) 



:c72c(V)k(/7)2-^'(™), 



where c('i/') = ^ / + [sl"^^ ~ 1^ ~ -sp^} ds dt, and the covariance 

Cov{d,,k,dj,k') = 2-^(2^+i)Cov(4,fc,c?o,fc') 

decays polynomially as k — k' ^ oo due to the two vanishing moments of ip 
and 

|Cov(do,fc,do,fc')l < c(l + \k- 

for some c which does not depend on a or H. See also Tewfik and Kim [22], 
Hirchoren and D'Attellis [14], Istas and Lang [15] and Gloter and Hoffmann 
[10]. 



Proposition 3. We have, for some constant c > 0, 

,2 \ 2-1 



sup EH,a 



-2jH^ 



c{i;)K{H) 



< c2"^(^+^^). 



Proof. Remark that, by stationarity, 

2 2J-1-1 



Q.j 



2-2i^^c(^)K(F) 



Y: {dl,-EHAdlk])- 

k=0 



Then the variance of the sum above is evaluated using the decorrelation 
property of the wavelet coefficients (similar computations can be found in 
Istas and Lang [15] and Gloter and Hoffmann [10]). □ 



4.2. Proof of Proposition 1. By Proposition 3, we derive in the same way 
as in Lemma II. 4 of Ciesielski, Kerkyacharian and Roynette [4] that, for all 
e>0, 



J2 sup FH,a 22i^Q,- ^ 

j>oiH,<T)ev I 



2 2 



< oo. 
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from which (i) easily follows. By (i), the probability that |Zj| is greater than 
a constant M is less than 



(14) e + ¥H,a 



sup \Qi+i - 2~^"Qi\2^^^ > M2~^'^7 
i>j 



By self-similarity, EH,a{Qi+i - 2~'^^Qi} = 0. By Markov's inequality, (14) 
is less than 

e+ [MV (e)]-i^Var^,,.(Q;+i -2-2^Qz)2^'^2^\ 

By Proposition 3, the sum above can be made arbitrarily small for large 
enough M, which proves (ii). 

4.3. Proof of Proposition 2. We first claim that the following estimate 
holds: 

(15) sup sup 2-^l'^^\J\Q^^^-Qj\\<cn-^. 

J^>j>h.(H)-L (H,a)&V 

Proposition 2 readily follows. To prove (15), we first split Qj^n ~ Qj into 

ELi^g,with 



k k 



^j,fc,7H ^j,n ~'^^i^j,k,n '^j,k,n)j 



k k 

^j,n ~ ^ ^ ^j,k,ndj^k, l^j^^ = 2 ^ bj^k,nej,k,n- 

k k 

Using the result that KH,a[{Xt - Xs)'^] < c{H)a'^\t - sp-^, it is readily seen 
that K'fj ^[{bj^k,n)^] is less than a constant times 2~^n~'^^. Summing over k 

shows that the term r^^^ is negligible since H > 1/2. 

Using the fact that ^ „ — Vj^k,n are uncorrelated for \k — k'\ > S, we de- 
duce that IEh,o-[('"j>I)^] is bounded by a constant times Z^fcLo ~^{^H,a[^'j fc n] + 
^li ab^j k n]} ■ Then using the martingale increments structure of the se- 
quence a{Xf.2-j+i2-N)C]^2-3+i2-'^ / = 0, . . . , S2^~^ (recall that n = 2^), 
we may apply the Burkholder-Davis inequality. This gives, by Assump- 
tion A, '^H,a[^'j,k,n] — cn~'^ . Then since x a{x) is bounded and, thus. 



'Vj,k,n <cn ^, we obtain that the right order 2^ 

Using conditional centering of ej^k.n with the fact that the variance of dj^^ 
is less than 02'^^"^^^^^^ and the condition j > jn{H) — L = [ 2H+1 iog2(^)] — L, 
one easily checks that the terms rj^2> ^f'l and rj^^ have negligible order. 



n 
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(3) 

We finally turn to the important term r^-^, which encompasses the esti- 
mation of a. We claim that, for < / < S2^~^ — 1, the following estimate 
holds: 

(16) niA\^k/2^,n - ni,aHXk/V+l/2^f I < ^n'^'^- 

Summing over / and k yields the result for r~^\ as soon as (16) is proved. 
Indeed, since f j,A:,n — ^i,fc,n is equal to 

E / . , ^J^Wrfi («V,n-E&,.K^fe/2.+//2-)'l-^2-.])> 

we have that E'^^[|rj^2|] is less than l'^n~^2^ l'^n~'^l'^ . Therefore, under 
the restriction j < Jn < [|log2(n)], (15) holds. It remains to prove (16). 
We have ^^/a.,, - E?,_,[a(X,/2.+v2-)' I -^^2-.] = 4!i + 41n + 4% with 

2JV/2 . 2^/2 \ 2 

^(1) _ 9-7V/2 x^2 / 9-Ar/2 T^n \ 

^k,n — ^ ^fc/2.?+Z'/2^ ~ M -'^fe/2^-J+i' I ' 

V=\ \ V=\ / 

2JV/2 

.(2) 



*fc,/,n - 2 ^ a(Xfc/2J+F/2^f)^('^fc2^-^f+;')^ ~ IE^,(T[«(-'^fc/2J+;/2^)^ I -^^2-^]' 

v=\ 

2JV/2 

*fc,n = 2 E -'^fc/2i+«'/2^«(-'^fe/2J+«72^)'^fc2i-^f+F- 

Since the Cfc2J-Jv+i' ^'^^ uncorrelated and centered, we readily have that the 

expectation of |4^il is of order 2~^/^ = n~^/^. For the term 4^in' use 
the preliminary decomposition 

2]V/2 

41n = 2"^/^ ^ a(X;i./2i+Z'/2Jv)^[(^fc2J-iV+/')^ - 1] 
/'=! 

2]V/2 

+ 2~^/^ E («(^fc/2i+«'/2'v)^ - IE^,^[a(Xfc/2J+i/2'v)^ I 
l'=l 

The expectation of the absolute value of the first term above is of order 
n~^/^ since the summands a(-^A;/2^+z72^)^[(Cfc2J-^+i')^ ~ martingale 
increments with second-order moments by Assumption A. Likewise, since 
X a{x) has a bounded derivative and 



EHAiXk/2.+i'/2N - ^fc/2.)'] < c{H)a\2~^/y^, 

k/23+1/2'^ ~ ^k/23 



^HAiXk/2^+i/2^ - ^fc/2. )'} < c{H)a\2-^/^f'' 
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(2) 

the second term in the expression of ^ ^ has absolute expected value less 
than a constant times {2'^/'^)^ < 2i"(^)^/2 = n~^/^^+^^\ and thus has the 
right order since H > 1/2. 

Finally, we further need to split t^j^l^ into 

2JV/2 . 2^/2 \2 

2 Af/2 ^ X^/2i+i'/2iV - 2 ^ -'^fc/2i+Z'/2^ ) 

l'=l \ l'=l / 

/ 2'V/2 \ 2 

- 1^2"^/^ «(-^fc/23+«'/2^)Cfc2^f-J+i'j 

/ 2^/2 \ / 2^/2 

- 21 2~^/2 ^ Xf./23+l'/2N I I 2"^/2 ^ a(X^./2J+i'/2^f)C, 

\ ;'=! / \ i'=i 



The first term and second term are easily seen to be of the right order, 
respectively, by the smoothness property of X and the fact that the variables 
are uncorrelated. The third term is seen to have the right order after 

observing that one can replace the first sum 2~'^/'^J2'i'Ji Xf./2j+i' /2^ by 
Xj^l23 up to a negligible error and then use the conditional zero correlation 
of the ^" again. Thus, (16) is proved; hence, (15) follows. The proof of 
Proposition 2 is complete. 

5. Proof of Theorem 1. First we need the following result that states 
the level J^, based on the data, is with large probability greater than some 
level based on the knowledge of H . 

5.1. A fundamental lemma. For e > 0, define 

(17) J-{e) :=max|j > 1; r-(e)2-2^'^ > 

Lemma 1. For all e >0, there exists L{e) > such that 
sup ^\AJn < Jnie) - L{e)] < e + (^„(e), 

where (pn satisfies lim„_^oo ¥'n(e) = 0. 

Proof. Let L,e > 0. By definition of Jn{^), 



l/{l+2/f)„ 1/(1+2//) ^ ^J-ie) ^ ^ (^'^1/{1+2H)^^1/{1+2H) 
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hence, for large enough n, we have J_< Jn{^) — L < Jn- Thus, by (9), 
o- [Jn ^ Jn (^) ~ is greater than 

which we rewrite as 

ti,cFl^ J„(£)-L,n ^J„{e)-L— ^J„{e)-LJ 
and which we bound from below by 



mi2^^^Qi <r(e) 
.i>i 



Proposition l(i) and the definition of J„ (e) yield that this last term is 
greater than 

nAQj,is)-L,n - Qj,ie)^L > r„ (e)^/(^^+^)n-^/(2^+i) (2"^ - 2^^^)] - e. 

Then, if L is such that 2^ - 2^^" < -1, an assumption we shall make from 
now on. Lemma 1 is proved, provided we show that 

can be made arbitrarily small, uniformly in {H,a). Using again 

2J„-(e)> l^l/(2//+l)^_(^)l/(2/f+l)^ 

we can pick L' = L'{e) > independent of n such that 

J-ie)-L>UH)-L'{e). 
Therefore, (18) is less than 



which we rewrite as 



Jn>j>jniH)-L'{e) 



J»>j>MH)-L'{e) 



where 

VH{e,n) := 2^'(^)r„(e) V{2^^+i)„i/{4/^+2) 

and where we use the fact that 2^"^^^ is of order n^/('^^'^^\ We conclude 
by applying Proposition 2, using the fact that, for fixed e > 0, 
2L'{e)j._^^-ji/i2H+i)^i/(4:H+2) ^ ^s n ^ oo. The uniformity in iH,a) is 

straightforward. □ 
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5.2. Proof of Theorem 1, completion. Since t invertible on 

(0,1) with inverse uniformly Lipschitz on the compact sets of (0,1), it suf- 
fices to prove Theorem 1 with 2~^^ in place of H and Qj*+i,n/Qj*,n in 
place of Hj*^n- First, we bound 



Qj*+l,n _ 



by a "bias" and a variance term, namely, 



Q 



j*+i 



-2H 



+ 



Qj*+l,n Qj*+l 



Qj*n 



--:\Bn\ + \Vn\, 



say. Second, we prove Theorem 1 for i?„ and Vn separately. We remark that 
the "bias" term Qj^i/Qj — 2~^^ is deterministic, conditional on X, and 
decreases as the level j increases, while the variance term Qj+i.n/Qj.n — 
Qj+i/Qj increases. They both match at level j = J^(e). In contrast to many 
"bias-variance" situations, the behavior of the variance term depends on 
the unknown regularity of the signal through the rate of decrease of the 
denominators Qj^n and Qj. This explains the choice made in (9) to control 
the estimated level of energy Qj*,n from below. 

5.2.1. The bias term. Let M > and e > 0. By Lemma 1, we have 

< P?,,,[ni/(4^+2)|fi„| > M, J* > J~{e) - L[e)] + e + ^n{e) 

.i(,)|>M]+e + (^„(e) 

< P^,, [ V2r_ (,) - V(4H+2) 2L(e)/2 1 \>M]+e + ^^ {e) , 

where we have used for the last line the fact that, by (17), 

2-J^{e) < 2r_(e)"^/(2^+^)n"^/(^^+^). 

We conclude by Proposition l(ii) and by taking successively e sufficiently 
small, M sufficiently large and n sufficiently large. 

5.2.2. The variance term. We split the variance term into Vn = vjl'^ + 
vji^\ where 



Qji,n 



Qj*+l{Qj* -Qj*,n) 

Qj*.nQj* 
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Having Lemma 1 in mind, we bound, for any M > and L an integer, the 
probability P^_^[nV(4^^+2) | > jy^j 

Fix e > and pick L = L{e) as in Lemma 1 so that the second probabihty 
P^g.[J^ < Jni^) — L{£)] is bounded by e + i/3n(e)- It remains now to deal 
with the first probability. As soon as n is large enough, — L{£) > J 

and, thus, by definition of J*, the denominator of Vn^^ is bounded below by 
2'^"/n. This yields a new bound for the first probability, 

FlJn'/^*^+'^+^2~''^Qj.+,,n - Qj*+i| > M, > J-{e) - L{e)]. 

Recall that we defined jn{H) = [ 2,^+1 ^og2(^)] Proposition 2 and by defi- 
nition of J~ (e) we have 

2^" (e) > |n^/(2^+^V_(e)^/(2^+^). 
Therefore, we can pick a positive L' = L'{e) independent of n such that 

J-ie)-L{e)>UH)-L'ie), 
and then we can bound the first probability by 

,i/(4H+2)+i _ 2-^Q,,n-Qj\>M . 



n 



sup 

Jn>j>MH)-L'(e) 



Next, using the fact that 77,1/(4^^+2)4-1 jg order n2-'"(^)/^ and Proposition 
2, this term can be made arbitrarily small (uniformly in n) by taking M 
large enough. 

We now turn to the term Vn ■ Fix e > and M > 0. Recalling the defi- 
nition of Zj in Proposition 1, we have 

pn_^[,l/(4H+2)|^p)|>^] 

Qj* ~ Qjf,,,n 



n 



l/(4//+2) 



Q 



J*,n 



(2-2^ + Zo) > M 



Now the tightness of the sequence Zj implies that, for some fixed constant 
M', this probability is less than 



n 



l/(4H+2) 



Qj*-Q 



Q 



> 



M 



Then the conclusion follows exactly as for Vn^^ . The proof of Theorem 1 is 
complete. 



16 



A. GLOTER AND M. HOFFMANN 



6. Proof of Theorems 2 and 3. Consistently with Section 4, we denote 
by ^H,a the probabihty measure on the Wiener space Cq of continuous func- 
tions on [0, 1] under which the canonical process X has the law aW^ . We 
write Pj for the law of the data, conditional on X = f. 



(19) 



6.1. Preliminaries. Define, for a £ (0, 1) and f gCq, 

\fit)-f{s)\ 



00+ sup 

0<s<t<l — 



with 



:SUpJ/(t)|. 



The total variation of a signed measure is 



I/^IItv 



sup 

/lloc<l 



fdfi 



If /i and v are two probability measures, the total variation of — is 
maximal when and v have disjoint support, in which case — i^||tv = 2. 

Proposition 4. Grant Assumptions A and B. We have, for some con- 
stant c> 0, 



TV 



<cnV2||/-5||V2 



and 



^^IItv > R{cn\\f - g\\i + c||/||^i/2 + 
where R is some universal nonincreasing positive function and 

ii/ii2=(/oV(^m'. 

Proof. Let D{n, i/) := / (log ^) dfx < +oo denote the Kullback-Leibler 
divergence between the probability measures ^ and v. We recall the classical 
Pinsker inequality — z^||tv < \/2-D(//, j^)^''^. 

Using Assumption B(ii) and the representation (1), we deduce 



dPl' 



f^o I \a{gi/n) 



this yields the expression for the Kullback-Leibler divergence. 



where „ = e(^7^ - 1) + '"'2, "'(" • By a second-order Taylor expansion, 



D{dPf,dP^)=Y^\¥. 



i=0 



d 

dx 



log 



ajfi/n) 

a{gi/n) 
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(20) 



1=0 



for some (random) 9i^n £ (0, 1). Using the fact that x ^ exp(— Uj^„(j;)) van- 
ishes at infinity, we have EJ[{£vi,n)m] = and E][{£vi,n){C)^r] = 1, 
integrating by parts. It follows that the terms in the first sum of (20) are 

equal to — 1 — log( "|f''^"h . The assumptions on x a(x) yield that 

this quantity is less than some constant times (/j/„ — Qi/n)'^- 

For the second-order terms, using the uniform Lipschitz assumption on 



^^Vi^nix) , together with the uniform bound for Ej[|^"p], gives 



< C\!iln-9iln? + 



n\ \ 2 



Again, we can bound \K'J[-£jVi^ni^f)AlJ \ by a constant times {fi/n-gi/nf, 
using the result that EJ[£vi,nm], ^[{i7Vi,n{m^?] and E][{i,Vi,nm){^?f] 
are controlled by sup^ „E^{(^'(;i,„(4"))2(l + |4"|2)}. Thus, the divergence 
between the conditional laws is bounded by 

n 

D{dPf, dPg^ < \fi/n - 9^/n\^ 

i=0 

and the first part of the proposition follows from Pinsker's inequality. For 
the second part of the proposition, we use 



n I 
Y.\f^/n-9^/n?<^n / [f {x) - g{x)f dx + ^T^'^^ 

i=0 •'^ 



2 , II ||2 N 

T-^a -T \\9\\-}ici), 



valid for any a S (0,1), together with the fact that for two measures fj,,!^ 
the total variation ||/i — i/||tv remains bounded away from 2 when the di- 
vergences D{fi,i') and D(z^, /i) are bounded away from +00. □ 

The next result is the key to the lower bound. Its proof is delayed until 
Section 7. Let ((To,-ffo) be a point in the interior of V. Set, for / > 0, := 
j-i^-i/{4Ho+2) and 



where 



Hi:=Ho + en, ai := ao2^o^", 
jo = [log2(n^/(^^"+^))]. 
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Proposition 5. For I large enough, there exists a sequence of probabil- 
ity spaces (A'",X",P") on which can be defined two sequences of stochastic 
processes, {Ct^)t&[a,i]j « = 0, 1 such that: 

(i) For 1/2 < a < Hq, the sequences ^^^^ llC^'^llw'^ tight 
under P" . 

(ii) Deyine P^'" = /;^„P"(dw)P-,„(^), and Q'}j^^ = f¥H,MfW''}, that is, 
the law of the data [Yp). Then 

lim ||P''"-Q:^J|tv = 0, i = 0,l. 

n^oo ' 

(iii) There exists a measurable transformation T" : ^ such that 
the sequence n||^^'"(a;) — ^''''"(T'"(a;))||2 is tight under P". 

(iv) If n is large enough, the probability measure P" and its image mea- 
sure T^P" are equivalent on Moreover, for some c* G (0,2), we 
have 

||P"-r'"P"||TV<2-c''<2, 
provided n is taken large enough. 

Remark. The processes ^P''^ and play the role of approximations 
for a^W^^ and aiW^^, respectively. Part (i) means that shares the 
same smoothness property as W^^ , while (ii) implies that observing a noisy 
discrete sampling of aiW^^ {i = 0, 1) or of its approximation is statisti- 
cally equivalent as n ^ oo. Of course, these points trivially hold in the case 
^o,n _ ^^^/Ho Qj^^ ^i,n _ ^_^^/Hl ^ jjgwever, a significant modification of this 
simple choice is needed in order to have the fundamental properties (iii) and 
(iv). These properties mean that one can transform pathwise, using T", the 
process ^^'^ into approximate realizations of C^'", while T" essentially does 
not transform P" into a measure singular with it. 

We next prove that Propositions 4 and 5 together imply Theorems 2 
and 3. 

6.2. Proof of Theorems 2 and 3. We prove Theorem 2 only. The proof 
of Theorem 3 is analogous since the choice of Hi and ai implies that cJi — do 
is of order 

CTolog(n) y(2+4Ho) 

I{l + 2Ho) 

Pick n large enough so that {ai,Hi) £ T>. Pick an arbitrary estimator Hn- 
Let M > 0, with M < 1/2/ for further purposes. We have 

sup P^,[ni/(^-f^+2)|jJ^ -H\>M] 

{H,a)£V 
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>ipO'"[ni/(4^«+2)|#„-i7o|>M] 

+ ipi'"[ni/(4^^+2)|^^ _H,\>M] + un. 

where u.„ — > as n — > oo by (ii) of Proposition 5. By definition of P*'" and 
by taking n large enough, it suffices to bound from below 

(21) i/^jP^oH[^°]+P,lH[^^])P"(^-)> 

where = {n-^/(^+^^')|i7„ — Hi\ > M}. By (iv) of Proposition 5, for n large 
enough, 

Thus (21) is equal to half the quantity 

for any A > 0, and where A"" denotes the set of a; S A'" such that 

n||e°'"(r"^)-e''"(^)i, ||C°'"(T"^)||h. and 
are bounded by r > 0. We will next need the two following technical lemmas. 



Lemma 2. For any r > 0, there exists c(r) > such that, on , 
Lemma 3. For large enough n, we have 



> e' 



>P"[A';^]-e-^-l + c72. 
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Applying successively Lemmas 2 and 3, we derive the lower bound 
e-^c(r)(P" [Af;^] - - 1 + 12). 
Thus, Theorem 2 is proved as soon as we verify 
(22) lim liminfP"[Af,!'] = 1. 



r— >oo n— ►oo 



It suffices then to take A and r large enough. By (i) and (iii) of Proposition 5, 
(22) only amounts to showing the tightness of ||C'^'"(T'"u;)||->^a under P". For 
L, L' > 0, we have 



p"[|ie°'"(T"(..))llH" >^] = y^^i{r,.Hii,.>,}^^(u^)p"(du.) 

dr"P" 



<L'P"[r'"(a;)||H.>L]+P 
<L'P"[||^0'"(a;)||H">L] + (L')-^ 



by Chebyshev's inequality. The tightness of \\^^'''^{T''^{uj))\\f{a then follows 
from the tightness of ||'^'^'"'||-Ha . The proof of Theorem 2 is complete. 

6.3. Proof of Lemmas 2 and 3. 

6.3.1. Proof of Lemma 2 . Since //q < -f^i) it suffices to bound from below 
P^o,.(r„^)[nV(4^^o+2)|^„ - ^ol > M] + P^,,„(^)[n^/(^^°+2)|iJ„ - H,\ > M]. 



Let 



dtest(/i,i^) := sup 
0</<l 



J fdfi- J fdiy 



denote the test distance between the probability measures /i and z/. The last 
term above is thus greater than 

— Cutest (IP50,n(J'nj^),P^l,n(j^)). 

Now since M < 1 /2L and by our choice for Hq and Hi , one of the two events 
in the expectation above must occur with probability one. Using the fact 
that (itcst(^) i^) — ^Wl-'' ~ i^IItVi the last term above is further bounded below 
by 



-1 _ 1 llpn _ 

1 2\\'^^0.n(^npn^) ^5l'"(<.^)llTV 



We conclude by Proposition 4 together with the fact that to G X^. 
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6.3.2. Proof of Lemma 3. It suffices to bound from below 



-{T"uj)<e-^ 



P"(da;) 



since T^P" and P" are equivalent. We now replace the measure T"P" in 
the integral above by P" with an error controlled by the test distance; the 
lower bounds become 



P"[A', 



dP" 



:P'^[;f; 



dP" 



(itest(P",r"P") 



TV- 



We conclude by the Chebyshev inequality and Proposition 5(iv) 



7. Proof of Proposition 5. The proof of Proposition 5 relies on the con- 
struction of the fractional Brownian motion given by Meyer, Sellan and 
Taqqu [20]. In Section 7.1 we recall the main steps of the construction and 
how to apply it to our framework. In Section 7.2 we construct the sequence 
of spaces (A'", X", P"). The proof of (i)-(iv) is delayed until Sections 7.3.1- 
7.3.4. 



7.1. A synthesis of fractional Brownian motion. Consider a scaling func- 
tion (j) whose Fourier transform has compact support as in Meyer's book [19], 
with the corresponding wavelet function ip G 5(M). In [20] the authors in- 
troduced, for d G M, the following differentials of order d (via their Fourier 
transform) : 

D^i^is) := (isf^is), ^Hs) ■■= 

where a determination of the argument on C \ ]R_ with values in (— 7r,7r) 
is chosen. It is shown that the above formula is well defined and that 
D'^ip,(p'^'^ G cS(M). Define further, for d = 1/2 - i/ G (-1/2, 1/2), 

^^{t):= f D^i:{u)du = D^-^^{t), i;f^,,{t):=2^/^ij"{2H-k), 

J —OD 

Jo 

In their Theorem 2, Meyer, Sellan and Taqqu [20] prove the following al- 
most sure representation of fractional Brownian motion (on an appropriate 
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probability space and uniformly over compact sets of 



oo oo 



fc=— oo j=OA;=— oo 

where ef = YZoli^'k-i and (1 - rf = ET=olkr'' near r = 0. The e'^, 
k £ Z,ej^k,j >0,k £ Z are i.i.d. AA(0, 1) random variables. Note that 7^ = 
0{k~'^~^'^), so the series above converges in quadratic mean and the time 
series obtained, {e]})k, has spectral density equal to |2sin(|)|-'^~^^''. The 
scaling 

gives yet another representation for W^^ , 

00 

fc=— 00 

(23) 



00 00 



3=30 k=—oo 

Comparing with other decompositions of fractional Brownian motion (e.g., 
Ciesielski, Kerkyacharian and Roynette [4] and Benassi, Jaffard and Roux [2]), 
a particular feature is that the random variables appearing in the high fre- 
quency terms 

00 00 

E E 2-^(^^-^/^){^iS(t)-^f,(0)}e,,, 

j=jo fc=-oo 

are independent and independent of the low frequency terms. 

A drawback is that the basis used depends on H and the functions ap- 
pearing in the decomposition are not compactly supported. However, one 
can explore the properties of this basis. In [20], Meyer, Sellan and Taqqu 
show that the derivative of the initial wavelet function generates a multires- 
olution analysis and state the following results. 

Lemma 4 (Lemma 8 in [20]). (1) There exist smooth 2'K-periodic func- 
tions Ud and Vfi such that 

(^(s) = Ud{s/2)^^{s/2), DSj{s) = Vd{s/2)^^{s/2). 

These ''^filters'''' and Ud and Vd vanish respectively in a neighborhood of tt 
and 0. 
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(2) Let {ck)k& G /^(Z). Then the function J2k Ck2<))'^'^{2t - k) can be ex- 
pressed with the basis (j)'^'^{t — k) and one level of detail, 

Y,Ck2<P''''^{2t-k) 

(24) 

= ak<P''-^^{t -k)+Y, bkD<'i^{t - k), 
k k 

where {ak)kez o.nd (6fc)fc(=z G Moreover, a and b are given as follows: 

denoting by A, B and C the 27^ -periodic extensions of the discrete Fourier 
transforms of a, b and c, we have 

(25) A{s) = -4"''[Vdis/2 + 7r)C(s/2) - Vd{s/2)C{s/2 + 7r)]e''/\ 

(26) B{s) = -4:-'^[-Udis/2 + 7r)C(s/2) + Udis/2)C{s/2 + TT)]e''/^ 

From these properties we can show the following lemma, which will prove 
useful in controlling in Ti" norm the error made when we truncate the ex- 
pansion. It also explores some properties of the basis when H varies. 

Lemma 5. Let H G (0, 1). (i) If and uj^k ore two sequences such that 
l^^fcl < c(l + |A;|)'^ and \uj^k\ 1^ c{l + + \k\Y , then, for any a £ [0,1) and 
M >0, there exists c{a,M) such that, for all Jq, 



E E \Kk^f,k\\H <c{a,M)2-^^\ 



3=30 |fc|>2i+l 



E h.ej^,,|L ^<c(a,M)2"*^^o. 



lfc|>2J0+i 

(ii) For all M > 0, there exists c{M) such that, for all e > with H + e < 
1 and t G M, 

(27) I V'^^^ it) - (t) I < c(M) JYTW ' 

(iii) For all e > with H + e < 1, we have, for all /c G Z, 

(28) 

= ^aKe)ef+; + ^6,(£){V>o%(t) - <fc+z(0)}, 

where the coefficients ai{£) and bi{£) are such that, for all M , there exists 
c{M) such that, for all e, 

(29) max{\ai{e)\,\bi{e)\} < ec(M)(l + |/|)-^. 
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Moreover, the 271 -periodic function with Fourier coefficients bi{e) van- 
ishes in some neighborhood of zero independent of e. 

The proof of Lemma 5 may be found in the Appendix of [11]. 

7.2. The space (A'",^",?"). Let us recall that Hi = Hq + where 
e„ = /-i„-i/(2+4i/o). = [log2nV(i+2^^o)] and ai = cJo2Jo^". 

7.2.1. We take for A'" an infinite product of real lines, endowed with 
the product sigma field X", 

/ 2^0+1 \ / oo \ 

X^'-l (g) E ® (g) ® r)=:X^^XS. 

An element of X"" is denoted by w = {u!^,u>'^) with oj^ = (w^)|fc|<2Jo+i and 
Lu'^ = {i^x)x=(j^k);j>jo,\k\<2j+^- The projections on the coordinates are denoted 
by ekiuj) = a;|'for]/c|' <2^o+i and 6^-^(0;) = ujf^,^ for j > h, \k\ < V+K 

On we define the probability measure P" := Pg ® P^, where Pg is 
the unique probability on which makes the sequence (e^) a centered 
Gaussian stationary time series with spectral density |2sin(|)|^~^''^o. The 
probability measure P^ is the unique probability on X^ that makes the 
sequence {ej,k) i-i-d. A^(0, 1). 

7.2.2. As suggested by Section 7.1, we define an approximation of aoW^ 
by keeping a finite number of coefficients at each scale, 

|A:l<2J0+i 

(30) 

j>jo |fc|<2J+i 

Denote by T"'^ a linear mapping from X^ to itself such that, under the 
measure T"'^Pg, the coordinates (e^) form a centered Gaussian time series 
with spectral density |2sin(|)|^~2^i . Let 

(31) 6U^):=e,(r"'M- 

We then define on the same space an approximation for aiWi . A natural 
choice would be to take again (30) with {ai,Hi) and e'^. instead of {ao,Ho) 
and efc. We proceed a little bit differently: we replace all the 0j(,\ by their 

truncated expansion on using relation (28). We then 

reorder the sums and finally drop the terms with index k corresponding to 
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the localization outside [—2,2]. The reason is that we want to use the 
same basis as in ^^'"^ for the low frequency terms. 

This leads us to the following approximation for aiW^^: 

|fc|<2Jo+i 



(32) 



+ cxi2-^o(^^+V2)ej;.>(t) 

|;|<23o+i |fc|<2Jo+i 

+ Y cTi2-^o(^^+V2){^i/o(,)_^i.o(o)} Y k^.ei 

|«|<2-'0+i |fc|<2J0+i 

+ E E -i2"^-(^^^^/^){<.^(0-<.^(o)}e,,., 

j>jo |fc|<2J+l 

where the coefficients a = a{e) and b = 5(e) are defined by (28) with H = Hq, 
H + e = Hi. 

7.2.3. The last step is the construction of the mapping T" from (A"", X") 
to itself. Recalling (iii) of Proposition 5, we see that T" should transform 
outcomes of S^^'^ into approximate outcomes of Thus, we define the 
action of T" on the random space (rY",X") by making the low frequency 
terms of ^^'^(T^lo) exactly match the low frequency terms of .^^'"(0;). 

We define T'^'" on as the linear map such that 

(33) e«(T^'"a;)= ^ a/_fcefe(w) + e«(w), 

fc|<2J0+i 

(34) e,„,i(r2'"a;)= ^ bi_kek{u;) + ej,^i{Lo), 

fc|<2J0+i 

(35) ej^iiT^'^u) = ej,i{u) if j > h- 

We remark that the matrix of this linear map in the canonical basis of is, 
of course, infinite, but T^'" leaves invariant the finite-dimensional subspace 
® (®|fc|<2Jo+i ^) ® (0, 0, . . .) C and is the identity on a supplementary 
space. On the finite-dimensional subspace its matrix is Id + iir", where 
is the square matrix of size 2[2''''"^^ + 1], 



i^l-k)\i\,\k\<2n+i 
(^'-fc)|z!,|fc|<2Jo+i 



(36) : 
Finally, we set 

(37) 2^n^^n,2^^n,l^ 

where we denote again by T"'^ the extension of T"'^ (previously defined 
only on JY^) to X"' such that it is the identity on 0^^ X^. 
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As announced, the choice of T", with (30)~(35) and the fact that ai2~^^^^^ = 
o-o2Jo^"2"-'o^i =(To2~-'o-f^o, yields 

e^'"(w)-e°'"(T"(a;)) 

(38) =E E -i2-^-(^^^^/^H<.nt)-<.H0)}e,,,M 

i>io |fc|<2J+i 

i>io |fc|<2J+i 

We now have completed the setup of (^Y", X", P") and it now remains to 
prove that Proposition 5 holds. Let us stress that the choice of j'o is for that 
matter crucial. Clearly, Proposition 5(iii) requires that jo be large enough. 
Meanwhile, Proposition 5(iv) requires that the number of components of 
on which T" is different from the identity be as small as possible, which 
requires that jo be not too large. Since the proof is rather technical and 
quite long, we only sketch it here. A detailed proof may be found in [11]. 

7.3. Sketch of the proof of Proposition 5. 

7.3.1. Property (i). We see that the representation (23) and our choice 
(30) only differ by the terms corresponding to locations ^ [—2,2]. With 
the help of Lemma 5(i), it can be deduced that on some probability space we 
have 11^*^'" — o'dW^^" IIt.^^ ^ c{uj)2~^'^^° , where M is arbitrarily large and c{uj) 
is some random variable with finite moments coming from the randomness 
of the coefficient in the expansion (23). A similar bound may be obtained 
for 11^^'" — Then the property (i) of Proposition 5 follows from 
the almost sure smoothness property of the fractional Brownian motion. 

7.3.2. Property (ii). Proposition 4 gives immediately an almost sure re- 
lation on the conditional laws: ||P^,,„ - P^J.^i^, ||tv < cn^/^H^*-" _ aiW"* ||^^ 

for i = 0, 1. Combining with the study of the difference — aiW^', this 
shows that this total variation distance is bounded by c(u;)ri^/^2^^^-'''/^. We 
are then able to deduce that the same bound holds for the unconditional 
laws 

IIP*'" - Q^.^IItv < cni/22-Mi„/2_ 

Since M is arbitrarily large, property (ii) of Proposition 5 follows and it is 
clear that this property is not crucial for the calibration of jq. 

7.3.3. Property (iii). We write (38) as 
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where 

j>jo |fc|<2:'+l 

i>io |fc|<2J+i 

But Lemma 5(ii) implies that the difference ip^^ — ip^j^ is a function with 

uniform norm bounded by c2^l'^e and weU locahzed around kjl-^ . This en- 
ables us to evaluate the sum with respect to k in q\{t) and to deduce (for 
precise computations, see [11]) 

where x means equality in stochastic order. An analogous evaluation is 
obtained for q^it), using axT^^^ - oqT^^^ = ao2-^^°{2^^'>-^'>'' - 1). 

Hence, property (iii) of Proposition 5 follows from jo = [277"+! log2 n] , 
which implies that 2~'^^"^°e'^ is of order n~^. 



7.3.4. Property (iv). Let us focus only on the really delicate part, the 
evaluation of the total variation distance. By the triangle inequality, it suf- 
fices to show that ||P" - T"'^P"||tv and ||T"'2 o T"'^P" - T"'^P"||tv can 
be made arbitrarily small for an appropriate choice of / and for large enough 
n. Hence, we need to compare centered Gaussian measures. Let us start by 
evaluating the distance between the measures P*^ and T'^-^p'^. 

Recalling the construction of X"" in Section 7.2.1, these two measures only 
differ on the space of low frequencies X^, and the covariance matrix of P" 
on this space of dimension m = 2^*^^"^ -|- 1 is the Toeplitz matrix Tm{fo) with 
the function fo{s) = |2sin(|)|^~2^o [the notation Tm{f) is for the matrix 
with entries Tm{f)k,i ■= ^ Sl^ f {s)e^^^-^> ds for l<k,l< m]. The Gaus- 
sian measure T^'^Y*^ has, on the same space, covariance matrix Tm{fi) with 
/i(s) = |2sin(|)|^~2^i . Then some considerations of Gaussian measures en- 
able us to control, here, the distance between these two measures by the 
trace bound Tr([Tm(/i)T'm(/o)~^ — Icl]^). Now the proof consists in making 
the following sequence of approximations rigorous: 

Tr([T^(/i)r„(/o)-i -id]2) xTvf r„ 



m 
■ 2^ 



^-1 



1} 
fo 

Ms) 
Ms) 

c2^°e^ 



1 ds 



cF 



28 



A. GLOTER AND M. HOFFMANN 



The first approximation above expresses the quasi-homomorphism property 
of the Toeplitz operator / ^ Tm(/), while the second one is a kind of Szego 
theorem. The third approximation is obtained since || ^ — 1||2 < e, where the 
L^-norm is taken over [— 7r,7r]. Again, a detailed proof is presented in [11], 
where we use the method developed in Dahlhaus [5] and Fox and Taqqu [8] 
to deal with Toeplitz matrices (and Brockwell and Davis [3] too for more 
elementary results). 

Finally, the control of ||r"'2 o T^'^P" - T^'^P^Htv is obtained by similar 
techniques (see [11] for details). The property (iv) of Proposition 5 is proved. 

Acknowledgments. We are grateful to Yuri Golubev, Frangois Roueff and 
Philippe Soulier for helpful discussions and comments. 
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